Learning rewrite rules to improve plan quality
نویسنده
چکیده
Considerable planning and learning research has been devoted to the problem of learning domain specific search control rules to improve planning efficiency. There have also been a few attempts to learn search control rules that improve plan quality but such efforts have been limited to state-space planners. The reason being that most of the newer planning approaches are based on plan refinement. In such planners, information about the current state of the world that is required to evaluate a complex quality metric is simply not available during planning. An alternative technique is planning by rewritingthat suggests first generating an initial plan using a refinement planner and then using a set of rewrite-rules to transform it into a higher quality plan (Ambite ~ Knoblock 1997). Unlike the search control rules that are defined on the space of partial plans, rewrite rules are defined on the space of complete plans. This paper presents a system called REWRITE that automatically learns rewrite rules. REWRITE has three main components. The first is a partial-order causal-link planner (POP). The second component does the analytic work of identifying the replacing and to-be-replaced action sequences. The third component is a case library of plan-rewrite rules. The input to REWRITE’s analytic component is (a) a problem described by an initial state and goals (b) the plan and planning trace produced by the partial order planner for this problem, and (c) a "better plan" for the same problem. The better plan is the one that has a higher quality rating than the one produced by the underlying partial-order planner, as per the quality function that assesses how resources are impacted by each plan. This better plan might be provided by some oracle, by a user, or by some other planner. REWRITE’s analytic component first reconstructs a set of causal link relationships between the steps in the better plan and a set of required ordering constraints. The second step is to retrace POP’s planning-trace, looking for plan-refinement decisions that added a constraint that is not present in the better plan’s constraint set. We call such a decision point a conflicting choice
منابع مشابه
Cost-Based Learning for Planning
Most learning in planners to date has been focused on speedup learning. Recently the focus has been more on learning to improve plan quality. We introduce a different dimension: learning not just from failed plans, but learning from inefficient plans. We call this cost-based learning (CAL). CBL can be used to improve both plan quality and provide speedup learning. We show how cost-based learnin...
متن کاملLearning Rewrite Rules versus Search Control Rules to Improve Plan Quality
Domain independent planners can produce better-quality plans through the use of domain-speci c knowledge, typically encoded as search control rules. The planning-by-rewriting approach has been proposed as an alternative technique for improving plan quality. We present a system that automatically learns plan rewriting rules and compare it with a system that automatically learns search control ru...
متن کاملLearning Plan Rewriting Rules
Considerable work has been done to automatically learn domain-specific knowledge to improve the performance of domain independent problem solving systems. However, most of this work has focussed on learning search control knowledge-knowledge that can be used by a problem solving system during search to improve its performance. An alternative approach to improving the performance of domain indep...
متن کاملLearning to Improve Plan Quality
Adaptive automated planning systems that can, over time, improve the quality of plans they produce are a promising prospect. The first part of the article discusses the issues involved in designing quality improving learning for planning systems and reviews recent work on learning to improve plan quality. The second part describes our work on the Performance Improving Planning (PIP) System. The...
متن کاملLearned rewrite rules versus learned search control rules to improveplan qualityMuhammad
Domain independent planners can produce better-quality plans through the use of domain-dependent knowledge , typically encoded as search control rules. The planning-by-rewriting approach has been proposed as an alternative technique for improving plan quality. We present a system called Sys-REWRITE that automatically learns plan rewriting rules and compare it with Sys-SEARCH-CONTROL, a system t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999